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SEQUENCE LISTING 


(1) GENE! 


INFORMATION : 


(i) APPLICANT 


MOORE, PAUL A. 
RUBEN, STEVEN M 
EBNER , REINHARD 


(ii) TITLE OF INVENTION: TISSUE PLASMINOGEN ACTIVATOR-LIKE PROTEIN 

( iii) NUMBER OF SEQUENCES: 16 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : HUMAN GENOME SCIENCES, INC. 

(B) STREET: 9410 KEY WEST AVENUE 

(C) CSTY: ROCKVILLE 

(D) STATE: MD 

( E ) COUNTRY : USA 

(F) ZI3 : 20850 

(v) COMPUTER\ READABLE FORM 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY / AG&NT INFORMATION: 

(A) NAME: BROOKES, ANDERS A. 

(B) REGISTRATION NUMBER: 36,373 

(C) REFERENGE/DOCKET NUMBER: PF378 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (301) 309-8504 

(B) TELEFAX: \(301) 309-8512 


(2) INFORMATION FOR SEQ ID NO : 1 : 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2329 base pairs 

(B) TYPE: nucleAc acid 

(C) STRANDEDNE S S\ single 

(D) TOPOLOGY: liAear 

(ii) MOLECULE TYPE: DNA\ (genomic) 


(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 124 . . 9M3 

( ix) FEATURE : 

(A) NAME/KEY: sig_pepftide 

(B) LOCATION: 124 .. 184' 


(ix) FEATURE: 

(A) NAME/KEY: mat_pepti< 

(B) LOCATION: 187.. 913 



\ 

TTACCAGAAOv AGCATAACAA GGGCAGGTCT GACTGCAAGC TGGGACTGGG AGGCAGAGCC 

GCCGCCAAGG G^GCC TCGGT TAAACACTGG TCGTTCAATC ACCTGCAAGA CGAAGAGGCA 

AGG ATG CTG T r 
Met Leu 

-21 -20 \ -15 

CTA GCA GAA GCC 
Leu Ala Glu Ala 
-5 


CTG TAC CGG GAG GAC 
Leu Tyr Arg Glu Asp 

15 


20 


35 


ccc 

TGG 

TGC 

TAC 

GTC 

AGT 

GGC 

GAG 

GCC 

Pro 

Trp 

60 

Cys 

Tyr 

Val 

Ser 

Gly 

65 

Glu 

Ala 

TGC 

GAG 

GAC 

CTG 

CGC 

TGT 

CCA 

\ 

GAG 

ACC 

Cys 

75 

Glu 

Asp 

Leu 

Arg 

Cys 

80 

Pro 

Glu\ 

, Thr 

TTC 

ACG 

ACA 

GAA 

ATC 

CAG 

GAA 

GCG 

\ 

TCT 

Phe 

Thr 

Thr 

Glu 

lie 

95 

Gin 

Glu 

Ala 

s4r 

\ 

GTG 

CAG 

GTG 

TTC 

GCT 

CCT 

GCC 

AAC 

GCC 

Val 

Gin 

Val 

Phe 

110 

Ala 

Pro 

Ala 

Asn 

Ala 

115 

GCA 

GCT 

GTG 

CAG 

CCA 

GTG 

ATT 

GGG 

ATC 

Ala 

Ala 

Val 

125 

Gin 

Pro 

Val 

lie 

Gly 

130 

lie 

TCC 

AAG 

GAG 

AAA 

AAG 

GAC 

CTG 

GGA 

ACT 

Ser 

Lys 

140 

Glu 

Lys 

Lys 

Asp 

Leu 

145 

Gly 

Thr 

ACC 

ATG 

ATG 

GTG 

ATC 

ATC 

ATT 

GCC 

ATC 

Thr 

155 

Met 

Met 

Val 

lie 

lie 

160 

lie 

Ala 

lie 

TAC 

TCC 

TAC 

AAG 

AGG 

GGG 

AAG 

GAT 

TTG 

Tyr 

Ser 

Tyr 

Lys 

Arg 

175 

Gly 

Lys 

Asp 

Leu 

GTA 

TGT 

GAG 

AGG 

GAG 

ATG 

CAG 

CGA 

ATC 

Val 

Cys 

Glu 

Arg 

190 

Glu 

Met 

Gin 

Arg 

lie 

195 

ACC 

AAC 

CCC 

ACC 

TGT 

GAG 

ATT 

GTG 

GAT 

Thr 

Asn 

Pro 

205 

Thr 

Cys 

Glu 

lie 

Val 

210 

Asp 


85 


100 
:tg 

»eu 


\ 


AG< 
Ser 


CTG 

Leu 


GGA 

Gly 


AAA 

Lys 

180 

ACT 

Thr 


GAG 

Glu 


\ 


165 


GTC 

AGC 

AAC 

ATG 

CTC 

Val 

Ser 

-10 

Asn 

Met 

Leu 

TGG 

GAC 

AAC 

GGC 

CAC 

Trp 

Asp 

Asn 

Gly 

His 

10 

GGC 

CTC 

CGC 

TGC 

CTC 

Gly 

Leu 

Arg 

Cys 

25 

Leu 

GCC 

CCC 

GTG 

TCG 

GGG 

Ala 

Pro 

Val 

40 

Ser 

Gly 

GAG 

GAC 

CCG 

CGC 

GGG 

Glu 

Asp 

55 

Pro 

Arg 

Gly 

CCT 

GAG 

AAA 

CGG 

CCT 

Pro 

Glu 

Lys 

Arg 

Pro 

70 

CAG 

GCC 

CTG 

CCA 

GCC 

Gin 

Ala 

Leu 

Pro 

Ala 

90 

CCA 

GGT 

GCA 

GAT 

GAG 

Pro 

Gly 

Ala 

Asp 

105 

Glu 

GCT 

CGG 

AGT 

GAG 

GCG 

Ala 

Arg 

Ser 

120 

Glu 

Ala 

CGG 

GTG 

CGG 

ATG 

AAC 

Arg 

Val 

135 

Arg 

Met 

Asn 

TAC 

GTG 

CTG 

GGC 

ATT 

Tyr 

Val 

Leu 

Gly 

lie 

150 

V3GC 

ATC 

ATC 

TTG 

GGC 

Cjly 

V 

\ 

lie 

lie 

Leu 

Gly 

170 

CAG\ 

Gin 

CAT 

GAT 

CAG 

AAA 

^is 

\ 

Asp 

Gin 

185 

Lys 

CCC 

% 

TTG 

TCT 

GCC 

TTC 

Pro 

Leu\ 

Ser 

N^oo 

\ 

Ala 

Phe 

ACT 

GTC 

GTG 

GTC 

CAC 


Lys Thr Val Va 
215 


60 

120 

168 

216 

264 

312 

360 

408 

456 

504 

552 

600 

648 

696 

744 

792 

840 




60 


\ 


ACC 

AGC 

CAG 

ACT 

CCA 

GTT 

GAC 

.CCT 

CAG 

GAG 

GGC 

AGC 

ACC 

CCC 

CTT 

ATG 

Thr 

S^r 

Gin 

Thr 

Pro 

Val 

Asp 

Pro 

Gin 

Glu 

Gly 

Ser 

Thr 

Pro 

Leu 

Met 


22 0\ 

s. 




225 





230 





GGC 

CAG 

see 

GGG 

ACT 

CCT 

GGG 

GCC 

T GAGCCCCCCC AGTGGG C AGG 



Gly 

Gin 

ma 

Gly 

Thr 

Pro 

Gly 

Ala 









235 


\ 



240 




- 








AGCCCATGCA 'GACACTGGTG CAGGACAGCC CACCCTCCTA C AG C TAG GAG GAACTACCAC 
TTTGTGTTCT gVtTAAAACC CTACCACTCC CCCGCTTTTT TGGCGAATCC TAGTAAGAGT 
GACAGAAGCA GGTK3GCCCTG TGGGCTGAGG GTAAGGCTGG GTAGGGTCCT AACAGTGCTC 
CTTGTCCATC C C TTG GAG C A GATTTTGTCT GTGGATGGAG ACAGTGGCAG CTCCCACAGT 
GATGCTGCTG CTAAGGGCTT C CAAACATTG CCTGCACCCC TGGAACTGAA CCAGGGATAG 
ACGGGGAGCT CCCCCAOGCT CCTCTGTGCT TTACTAAGAT GGCTCAGTCT CCACTGTGGG 
CTTGAGTGGC ATACACTOTT ATTCATGGTT AAGGTAAAGC AGGTCAAGGG ATGGCATTGA 
AAAAATATAT TTAGTTTTTA AAATATTTGG GATGGAACTC CCTACTGACC TCTGACAACT 
GGAAACGAGT TTGTACTGAaX GTCAGAACTT TGGGTTGGGA ATGAGATCTA GGTTGTGGCT 
GCTGGTATGC TTCAGCTTGC TGGCAATGAT GTGCCTTGAC AAC CGTGGGC CAGGCCTGGG 
CCCAGGGACT CTTCCTGTTT C^TAAGGAAA GGAAGAATTG CACTGAGCAT TCCACTTAGG 
AAGAGGATAG AGAAGGATCT GCTCCGCCTT TGGCCACAGG AGCAGAGGCA GACCTGGGAT 
GCCCCAGTTT CTCTTCAGGG ATGGATAGTG ACCTGTCTTC ATTTTGCACA GGTAAGAGAG 
TAGTTAG CTA ACCTATGGGA ATTATACTGT GGGGCCTTGT GAGCTGCTTC TAAGAGGCTA 



AC CTGGAAAC TAAGCTCAGA GGCAAGGTAA TAAAGCACTT CAGGGCTTGC TCCCCAAGTG 

\ 

\ 

GGCCTGATTT AG CAGGTGGT CTGCGGGCGT C C AG GT C AG C ACCTTCCTGT AGGGCACTGG 

\ 


GGCTAGGGTC ACAGCCCCTA AC 


TCATAA^ 


G CAATCAAAGA AC C ATTAGAA AGGGCTCATT 


\ 

AAGCCTTTTG gacacaggac cccagagagg aaaaagtgac ttgcccaagg TCGTAAGCAA 

GCTACTGGCA TGGCAAGAGC CCAGCTTCCT\ GACGGAGCGC AACATTTCTC CACTGCACTG 

TGCTAGCAGC TCAGCAGGGC CTCTAACCTG TGATGTCACA CTCAAGAGGC CTTGGCAGCT 

C C TAG C CAT A GAGCTTCCTT TCCAGAACCC TT^CCACTGCC CAATGTGGAG ACAGGGGTTA 

\ 

GTGGGGCTTT CTATGGA GCC ATCTGCTTTG GGGAC CTAG A CCTCAGGTGG TCTCTTGGTG 


t'c 


TTAGTGATGC TGGAGAAGAG AATATTACTG GTTTCTACTT TTCTATAAAG GCATTTCTCT 

\ 

\ 

\ 


ATAAAAAAAA AAAAAA 


(2) INFORMATION FOR SEQ ID NO : 2 : 


\ 


\ 

(i) SEQUENCE CHARACTERISTICS: \ 

(A) LENGTH: 263 amino acids \ 

(B) TYPE: amino acid \ 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 


l 

* 

\ 




\ 


888 

933 

993 

1053 

1113 

1173 

1233 

1293 

1353 

1413 

1473 

1533 

1593 

1653 

1713 

1773 

1833 

1893 

1953 

2013 

2073 

2133 

2193 

2253 

2313 

2329 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Leu Le\i Ala Trp Val Gin Ala Phe Leu Val Ser Asn Met Leu Leu 

-21 -20 \ -15 -10 

Ala Glu Ala \yr Gly Ser Gly Gly Cys Phe Trp Asp Asn Gly His Leu 

-5 \ 1 5 10 

Tyr Arg Glu As® Gin Thr Ser Pro Ala Pro Gly Leu Arg Cys Leu Asn 

l^K 20 25 

Trp Leu Asp Ala yin Ser Gly Leu Ala Ser Ala Pro Val Ser Gly Ala 

30 \ 35 40 

Gly Asn His Ser Tyr Cys Arg Asn Pro Asp Glu Asp Pro Arg Gly Pro 

45 \ 50 55 

Trp Cys Tyr Val Ser\Gly Glu Ala Gly Val Pro Glu Lys Arg Pro Cys 

60 \ 65 70 75 

Glu Asp Leu Arg Cys Pbro Glu Thr Thr Ser Gin Ala Leu Pro Ala Phe 

80 \ 85 90 

Thr Thr Glu lie Gin Gl\ Ala Ser Glu Gly Pro Gly Ala Asp Glu Val 

95 \ 100 105 

Gin Val Phe Ala Pro Ala ysn Ala Leu Pro Ala Arg Ser Glu .Ala Ala 

110 \ 115 120 

Ala Val Gin Pro Val lie Gly lie Ser Gin Arg Val Arg Met Asn Ser 

125 iyO 135 

Lys Glu Lys Lys Asp Leu Gl\\ Thr Leu Gly Tyr Val Leu Gly Tie Thr 

140 145 \ 150 155 

Met Met Val lie lie lie Ala \le Gly Ala Gly lie lie Leu Gly Tyr 

160 \ 165 170 

Ser Tyr Lys Arg Gly Lys Asp Leu Lys Glu Gin His Asp Gin Lys Val 

175 \ 180 185 

Cys Glu Arg Glu Met Gin Arg lie V'hr Leu Pro Leu Ser Ala Phe Thr 

190 195 \ 200 

Asn Pro Thr Cys Glu lie Val Asp G\u Lys Thr Val Val Val His Thr 

205 210 \ 215 

Ser Gin Thr Pro Val Asp Pro Gin GlmGly Ser Thr Pro Leu Met Gly 
220 225 \ 230 235 

Gin Ala Gly Thr Pro Gly Ala \ 

240 \ 

(2) INFORMATION FOR SEQ ID NO: 3: \ 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 amino acids 

(B) TYPE: amino .acid 

(C) STRANDEDNESS': single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 


Tyr 

1 

Val 

\Phe 

Lys 

Ala 

5 

Gly 

Lys 

Tyr 

Ala 

Cys 

S^r 

Glu 

20 

Gly 

Asn 

Ser 

Asp 

Tyr 

Arg 

Gly 

35 

Vhr 

His 

Ser 

Leu 

Thr 

40 

Trp 

Asn 

50 

Ser 

Met 

lie 

V 

Leu 

lie 

55 

Gly 

Ser 

65 

Ala 

Gin 

Ala 

Deu 

Gly 

70 

Leu 

Gly 

Asp 

Gly 

Asp 

Ala 

Lys'' 

85 

\Pro 

Trp 

Cys 

Thr 

Trp 

Glu 

Tyr 

100 

Cys 

Asp 

Val 

Pro 

Gin 

Tyr 

Ser 

115 

Gin 

Pro 

Gin 

\phe 

Arg 

120 

lie 

Ala 

130 

Ser 

His 

Pro 

Trp 

Git 

135\ 

Ala 

Ser 

145 

Pro 

Gly 

Glu 

Arg 

Phe 

150 

Leu 

uys 

Trp 

He 

Leu 

Ser 

Ala 

165 

Ala 

His 

Cys' 

His 

Leu 

Thr 

Val 

180 

lie 

Leu 

Gly 

Arg 

Glu 

Glu 

Gin 

195 

Lys 

Phe 

Glu 

Val 

Glu 

200 

Asp 

Asp 

210 

Asp 

Thr 

Tyr 

< 

Asp 

Asn 

215 

Asp 

Asp 

225 

Ser 

Ser 

Arg 

Cys 

Ala 

230 

Gin 

Glu 

Leu 

Pro 

Pro 

Ala 

Asp 

245 

Leu 

Gin 

Leu 

Ser 

Gly 

Tyr 

Gly 

260 

Lys 

His 

Glu 

Ala 

Leu 

Lys 

Glu 

275 

Ala 

His 

Val 

Arg 

Leu 

280 

Gin 

His 

290 

Leu 

Leu 

Asn 

Arg 

Thr 

295 

Val 

Asp 

305 

Thr 

Arg 

Ser 

Gly 

Gly 

310 

Pro 

Gin 

Gly 

Asp 

Ser 

Gly 

Gly 

325 

Pro 

Leu 

Val 


Ser 

Ser 

10 

Glu 

Phe 

Cys 

Ser 

Thr 

15 

Pro 

Cys 

25 

Tyr 

Phe 

Gly 

Asn 

Gly 

30 

Ser 

Ala 

Glu 

Ser 

Gly 

Ala 

Ser 

45 

Cys 

Leu 

Pro 

Lys 

Val 

Tyr 

Thr 

60 

Ala 

Gin 

Asn 

Pro 

Lys 

His 

Asn 

75 

Tyr 

Cys 

Arg 

Asn 

Pro 

80 

His 

Val 

90 

Leu 

Lys 

Asn 

Arg 

Arg 

95 

Leu 

Ser 

105 

Cys 

Ser 

Thr 

Cys 

Gly 

110 

Leu 

Arg 

lie 

Lys 

Gly 

Gly 

Leu 

125 

Phe 

Ala 

Asp 

Ala 

lie 

Phe 

Ala 

140 

Lys 

His 

Arg 

Arg 

Gly 

Gly 

lie 

155 

Leu 

lie 

Ser 

Ser 

Cys 

160 

Phe 

Gin 

170 

Glu 

Arg 

Phe 

Pro 

Pro 

175 

His 

Thr 

189\ 

Tyr 

Arg 

Val 

Val 

Pro 

190 

Gly 

Glu 

Lys 

V r 

lie 

Val 

His 

205 

Lys 

Glu 

Phe 

lie 

Ala\ 

Leu 

Leu 

220 

Gin 

Leu 

Lys 

Ser 

Ser 

Ser 

v\i 

23^ 

\ 

Val 

Arg 

Thr 

Val 

Cys 

240 

Pro 

Asp 

250 

Trp 

Vhr 

Glu 

Cys 

Glu 

255 

Leu 

Leu 

265 

Ser 

Pro 

Ph\ 

Tyr 

\ 

Ser 

270 

Glu 

Arg 

Tyr 

Pro 

Ser 

Ser 

j\g 

285i 

\ 

Cys 

Thr 

Ser 

Thr 

Asp 

Asn 

Met 

300 

Leu 

\ys 

Ala 

Gly 

Ala 

Asn 

Leu 

315 

His 

Asp 

Al\ 

Cys 

i 

Gin 

320 

Cys 

Leu 

330 

Asn 

Asp 

Gly 

Arg 

Met 

3v5 

Thr 


Leu Val Gly lie lie Ser Trp Gly Leu Gly Cys Gly Gin Lys Asj\Val 


r i 


r ^ 


340 


345 


350 


Pro Gly Val Tyr Thr Lys Val Thr Asn Tyr Leu Asp Trp lie Arg Asp 
3B5 360 365 

Asn Met Arxf Pro 
370 \ 


(2) INFORMATION FOR SEQ ID NO : 4 : 

( i ) SEQUENCE\CHARACTERISTICS : 

(A) LENGTH: 250 base pairs 

(B) TYPE :\ nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE\: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

ATTGCACTGA GCATTCCACT TAGGAAGAGG ATAGAGAAGG ATCTGCTCCG CCTTTGGCCA 

\ 

CAGGAGCAGA GGCAGACCTG GG ATCjC C C C A TTTCTCTTCA GGGATGGATA GTGACCTGTC 


TTCATTTTGC ACAGGTAAGA GAGTAGTTAG CTAACCTATG GGAATTATAC TGTGGGGCCT 
TGTAGCTGCT TCTAAGAGGC TAACCTG^AA ACTAAGCTCA GAGGCAAGGT AATAAAGCAC 
TTCAGGGCTT \ 


INFORMATION FOR SEQ ID NO:5:\ 

\ 

(i) SEQUENCE CHARACTERISTICS \ 

(A) LENGTH: 247 base pairs 

(B) TYPE: nucleic acid \ 

(C) STRANDEDNESS: single \ 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 


i 

r\: 5 : 


(xi) SEQUENCE DESCRIPTION: SEQ ID NOX: 5 : 

ATAGAGAAAT GCCTTTATAG AAAAGTAGAA ACCAGTAJV'rA TTCTCTTCTC CAGCATCACT 

\ 

\ 

AACACCAAGA GACCACCTGA GGTCTAGGTC CCCAAAGCA(^ATGGCTCCAT AGAAAGCCCC 
ACTAACCCGT CTCCACATTG GGCAGTGGAA GGGTTCTGGA \aGGAAGCTC TATGGCTAGG 
AGCTGCCAAG GCCTCTTGAG TGTGACATCA CAGGTTAGAG GC£CTGCTGA GCTGCTAGCA 
CAGTGCA \ 


(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 




T 


] 




64 


(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCER DESCRIPTION: SEQ ID NO : 6 : 

AATTCGGCAA GAGTAACAGC ATAACAAGGG TAG GT C TGAC TGCAGCTGGG ACTGGGAGGC 
AGAGCACGCC AAGGGGGcW CGGTTAAACA CTGGTCGTTC AATCACCTGC AAACGAGGAG 
GCAAGGATGC TGTTGGCCTG GGTACAGCAT TCCTGGTCAG CAACATGCTC CTAGCGTAAG 
C CTATGGATC TGGAGGCTGT\TTCTGGGACA ACGGCCACTG TACCCGGAGG ACCAGACCTT 

CCCGGCCGGT CCTCGTGCCT GAACTGGCTG GACGCGCAGG GCTGCCTGGG CCCCCTTTTC 

\ 

GGTCAAATTT CACAGTTTAC TTCGAAACCG GGACGGGGCC GTGGGGGCCC TGGTGGTTAG 

\ 

TTTGGGGTCG GGTTTTCTTA AA^AAGGTTT TTGGGGCCGG TTTTCGGAAC CATTTCGGTT 
GAATTTTTTA GGGAAATTTC AGG^GTTTTT TAAGGG C CAT T 
(2) INFORMATION FOR SEQ Id\nO:7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 99 ba'se pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: s\ingle 

(D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: S^Q ID NO : 7 : 

v 

GGCAAGTTGC AGAACTGGAA ACGAGTTTGT A^AGAAGTCA 

GATCTAGGTT GTGGCTGCTG GTATGCTTCA TT^GCTGGCAA 

GGGCCAGGCC TG G G AC C AG G GACTCTTCCT GTI^TCATAAG 

GCATTCCACT TAGGAAGAGG ATAGAGCAAG GAAT^CTGCTC 

AGGCAGACCT GGGATGCCCC AGTTCTCTTT CAGGGATGGG 

\ 

TTGCACAGGT AAAGAGAGTT AGTTAGCTAA CCTATT^GGGC 

\ 

AGCTGCTTTT TAAGAGGTTA ACCTGGAACT AAAGTTGAG 


GAACTTTGGG TTAGGAATGA 
TAATGTGCCT TGACAACCGT 
GAAAGGAAGA ATTGCACTGA 
CGCTTTGGCC ACAGGAGCAG 
ATAGTGAC C T GTCTTACATT 
TTTATTACTT GGGG CTTGTG 


(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


\ 


\ 


\ 


\ 


\ 


\ 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 


\ 


60 

120 

180 

240 

300 

360 

420 

461 


60 

120 

180 

240 

300 

360 

399 


TAATTKGGCA 


AGGGACAGGT 

CTGACTGCAG 

CTGGGACTGG 

GAGGCAGAGC 

CGTCAAGGGG 

60 

AACACTGGTC 

GTTCAATCAC 

CTGCAACGAG 

AGGCAAGGAT 

GCTGTTGGCC 

120 

CATTCCTGTC 

AG C AACATG C 

T C C TAG C AG A 

AAGCCTATGG 

ATCTGGGAGG 

180 

teACAACGGCC 

ACCTGTACCG 

GAGGACCAGA 

CCTCCCCGGC 

CGGGCCTTCC 

240 

aVtggtttga 

CGTGGCAAAG 

GGGCTTGTCT 

GGCCCTTTTG 

GGGGAAAATT 

300 

ATTGTCCCGG 

AAAAC CTGGA 

GAGG 



334 


(2) INFORMATION\FOR SEQ ID NO: 9: 

(i) SEQUENCE^ CHARACTERISTICS: 

(A) LENGTH: 472 base pairs 

(B) TYPe\ nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLQGY: linear 


(ii) MOLECULE TY] 


DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

AATTCGGCAG AGGGAGAGGG AgAtGCAGCG AATCACTCTG CCCTTGTCTG CCTTCACCAA 
CCCCACCTGT GAGATTGTGG ATGAGAAGAC TGTCGTGGTC CACACCAGCC AGACTCCAGT 
TGACCCTCAG GAGGGCAGCA CCCCC^TTAT GGGACCAGGC CGGGGACTCC TGGGGCCTGA 

GCCCCCCAGT GGGGCAGGAG CCATGGOAGA CACTGGTGCA GG AC AG C C AC CCTCCTTACA 

\ 

GCTAGGGGGA ACTACCACTT TGTGTTTC^G GTTTAAAACC CTACCACTCC CGGATTTTTT 
GGCGGATTCC TTAGTTAAGA GTACAGAAGCS AGGTGGGCCT ATGGCTTGGA GGGTAAGGTG 
GGGTAGGGTT C CTAAAAGTG GGTTCTTGGT ^TGCTCCTGGG AGGAAGATTT TGGTTTTGGT 
GGGGACAGTG GCAGTTTCCA CAGGTTGTTG T^TTAAGGGG TTCAAAAAAT TG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 


60 

120 

180 

240 

300 

360 

420 

472 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGGCACGAGA TGAACTCCAA GGAGAAAAAG GAC CTGGGAA CTCTGGGTAT GACGGTCCCC 
CACCCCTGCC CTTGTTGGGA TTCATCAAGA GATGTCATTT GCT^jATTGTC TAGGGTGTGG 

CTAATGGGAC CTTGTGTCCT ATCCTTGGCA GGCTACGTGC TGGGCATTAC CATGATGGTG 

\ 

ATCATCATTG CCATCGGAGC TGGCATCATC TTGGGCTACT CTACAAGAGG TCAGTAGCTT 


60 

120 

180 


240 


m;hh «inrit 


r i 


CTCTTCTGGG CGCTCTTAGG AGGAGGGGAG GAAGGTACAC AAAGTCAAAC T 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) UENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE VyPE : DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GGCCGACATG TCTGGAGGCTT GTTTCTGG 
(2) INFORMATION FOR SEQ ID NO: 12: 


SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNHSS : single 

(D) TOPOLOGY: Vl inear 


(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGCGGAAGCT TATTAGGCCC CAGGAGTCCC GGC 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 basd pairs 

(B) TYPE: nucleic acid . 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linearX 

(ii) MOLECULE TYPE: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GGCCGGGATC CGCCATCATG CTGTTGGCCT G.GGTAC 
(2) INFORMATION FOR SEQ ID NO: 14: \ 


(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE \DESCRIPTION: SEQ ID NO: 14: 
GGCCGGGTAC CTTATTAGGC C C CAGGAGTC CCGGC 
(2) INFORMATION FOF8 SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDKDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYP^: DNA (genomic) 


(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15 
GGCCGGGATC CGCCATCATG C'BGTTGGCCT GGGTAC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS \ single 

(D) TOPOLOGY: linfear 

(ii) MOLECULE TYPE: DNA \( genomic) 


(xi) SEQUENCE DESCRIPTION A SEQ ID NO: 16 


GGCCGGGTAC CTTATTAGGC C C CAGGAGTC CCGGC 
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